Announcing the Final Examination of Ramin Mehran for the degree of Doctor of Philosophy Time & Location: November 1, 2011 at 3:00 PM in HEC 101 Title: Analysis of Behaviors in Crowd Videos
ثبت نشده
چکیده
In this dissertation, we address the problem of discovery and representation of group activity of humans and objects in a variety of scenarios, commonly encountered in vision applications. The overarching goal is to devise a discriminative representation of human motion in social settings that captures a wide variety of human activities observable in video sequences. Such motion emerges from the collective behavior of individuals and their interactions and is a significant source of information typically employed for applications such as event detection, behavior recognition, and activity recognition. We present new representations of human group motion for static cameras, and propose algorithms for their application to variety of problems. We first propose a novel method to model and learn the scene activity of a crowd using Social Force Model for the first time in the computer vision community. We present a method to densely estimate the interaction forces between people in a crowd, observed by a static camera. The patterns of activities of the objects in the scene are modeled in the form of volumes of interaction forces. Second, we propose a method based on the Lagrangian framework for fluid dynamics, by introducing a streakline representation of flow. We propose a method to distinguish different group behaviors such as divergent/convergent motion and lanes using this framework. Finally, we introduce flow potentials as a discriminative feature to recognize crowd behaviors in a scene. Results of extensive experiments are presented for multiple real life crowd sequences involving pedestrian and vehicular traffic. The proposed method exploits optical flow as the low level feature and performs integration and clustering to obtain coherent group motion patterns. However, we observe that in crowd video sequences, as well as a variety of other vision applications, the co-occurrence and inter-relation of motion patterns are the main characteristics of group behaviors. In other words, the group behavior of objects is a mixture of individual actions or behaviors in specific geometrical layout and temporal order. We, therefore, propose a new representation for group behaviors of humans using the inter-relation of motion patterns in a scene. The representation is based on bag of visual phrases of spatio-temporal visual words. We present a method to match the high-order spatial layout of visual words that preserve the geometry of the visual words under similarity transformations.
منابع مشابه
Announcing the Final Examination of Ramin Mehran for the degree of Doctor of Philosophy Time & Location: November 1, 2011 at 3:00 PM in HEC 101 Title: Analysis of Behaviors in Crowd Videos
In this dissertation, we address the problem of discovery and representation of group activity of humans and objects in a variety of scenarios, commonly encountered in vision applications. The overarching goal is to devise a discriminative representation of human motion in social settings that captures a wide variety of human activities observable in video sequences. Such motion emerges from th...
متن کاملAnnouncing the Final Examination of Jingen Liu for the degree of Doctor of Philosophy Time & Location: November 3, 2009 at 9:00 AM in HEC 101 Title: Learning Semantic Features for Visual Recognition
Recently, bag of visual words (BoVW) representation, in which the image patches or video cuboids are quantized into visual-words (VWs) based on their appearance similarity, has been widely and successfully explored. The advantages of this model are that no explicit detection of object parts and their tracking are required, and it is efficient for matching. But, the performance of BoVW is sensit...
متن کاملAnnouncing the Final Examination of Jingen Liu for the degree of Doctor of Philosophy Time & Location: November 3, 2009 at 9:00 AM in HEC 101 Title: Learning Semantic Features for Visual Recognition
Recently, bag of visual words (BoVW) representation, in which the image patches or video cuboids are quantized into visual-words (VWs) based on their appearance similarity, has been widely and successfully explored. The advantages of this model are that no explicit detection of object parts and their tracking are required, and it is efficient for matching. But, the performance of BoVW is sensit...
متن کاملAnnouncing the Final Examination of Jingen Liu for the degree of Doctor of Philosophy Time & Location: November 3, 2009 at 9:00 AM in HEC 101 Title: Learning Semantic Features for Visual Recognition
Recently, bag of visual words (BoVW) representation, in which the image patches or video cuboids are quantized into visual-words (VWs) based on their appearance similarity, has been widely and successfully explored. The advantages of this model are that no explicit detection of object parts and their tracking are required, and it is efficient for matching. But, the performance of BoVW is sensit...
متن کاملAnnouncing the Final Examination of Rochelle Elva for the degree of Doctor of Philosophy Time & Location: July 5, 2013 at 2:00 PM in HEC 450 Title: DETECTING SEMANTIC METHOD CLONES IN JAVA CODE USING METHOD IOE-BEHAVIOR
The determination of semantic equivalence is an undecidable problem; however, this dissertation shows that a reasonable approximation can be obtained using a combination of static and dynamic analysis. This study investigates the detection of functional duplicates, referred to as semantic method clones (SMCs), in Java code. My algorithm extends the input-output notion of observable behavior, us...
متن کامل